04:00
2026-06-16
arxiv.org
large-language-models
Are Online Skill and Memory Modules Always Worth Their Tokens? A Budget-Constrained Study of Web Agents
A new study finds that online web agents augmented with memory, workflow, or skill modules often fail to outperform a token-matched vanilla baseline under a fixed inference budget. Testing across threβ¦